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Abstract. It is one of the wonderful "coincidences" of the theory of finite groups that the simple 
group G of order 25920 arises as both a symplectic group in characteristic 3 and a unitary group 
in characteristic 2. These two realizations of G yield two G-covers of the moduli space ) of 
configurations of six points on the projective line modulo PGL2, via the 3- and 2-torsion of the 
Jacobians of the double and triple cyclic covers of branched at those six points. Remarkably 

We 



these two covers are isomorphic. This was proved over C by transcendental methods in |HW 



give an algebraic proof valid over any field not of characteristic 2 or 3 that contains the cube roots 
of unity. We then explore the connection between this G-cover iS of ) and the elliptic surface 
2/^ = a;^ + sextic(t), whose Mordell-Weil lattice is Eg, with automorphisms by a central extension 
of G. 

0. Introduction. The moduli spaces of the title all cover the moduli space ( ^ ) of unordered 
sextuples of distinct points on modulo the action of PGL2, or equivalently of sextic 
polynomials S{t) without repeated roots modulo the action of GL2. [A quintic counts as 
a sextic with a root at infinity; polynomials of lower degree would have repeated roots at 
infinity and are thus excluded.] To a configuration of six distinct points on are associated 
two curves C : = S{t) and C : — S{t) of genus 2 and 4, which are cyclic covers of 
of degrees 2 and 3 branched at those six points. Two of our covers of ( g ) are obtained by 
adding full level-3 structure to C and full level-2 structure to C"; that is, they are the moduli 
of genus-2 curves C with a choice of generators for the 3-torsion subgroup of the Jacobian 
J(C), and of curves C" with a choice of generators for the 2-torsion subgroup of J(C"). The 
Weil pairing gives these torsion subgroups the structure of four-dimensional spaces over the 
finite fields F3 and F4, with respectively a symplectic and a unitary structure; thus the 
corresponding moduli spaces are normal covers of ) with Galois groups PSp4(F3) and 
SU4(F2). It is one of the wonderful "coincidences" of the theory of simple finite groups that 
these two groups of order 25920 coincide; let G be the finite group isomorphic with both 



PSp4(F3) and SU4(F2). (See page 26 of the ATLAS [|C&| for this identification and for and 
further properties of G that we shall use.) It is a remarkable fact that the two G-covers 
of ) are also isomorphic. This is proved in |IIW| (see also Ch.5]) by transcendental 



methods: first replace (^ ) by the moduli space P^g^ of ordered sextuples of distinct points, 
which is an cover of ( ^ ) ; then regard both G-covers of P^g^ over C as quotients of the 
complex 3-ball by arithmetic groups acting freely; and prove that they are isomorphic by 
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computing enough invariants. Our two G-covers of ) are then quotients of these G-covers 
of P^g-j by the same action of 5*6, are thus isomorphic as weU. Of course this leaves completely 
mysterious the algebraic meaning of the identification between the two moduli spaces. 

We obtain this identification algebraically via a third moduli space S: the space of sextic 
polynomials S{t) together with all representations of S as the difference between the square 
and the cube of polynomials of degree at most 3 and 2, which we shall call minimal rep- 
resentations of S{t) as — x'^. By counting parameters we may surmise that the forgetful 
map from 5 to ) is a Galois cover, but it is not at all clear what the group should be. 
We prove that the group is G by constructing algebraic maps from S to the two geometric 
G-covers of ) and showing that the maps are isomorphisms. This of course yields an alge- 
braic isomorphism between the J(G)[3] and J(G')[2] moduli spaces. We then give a further 
interpretation of G as the group of automorphisms of a Z[e^"*/'^] lattice in isomorphic 
as a Euclidean lattice with Es, which arises here as the Mordell-Weil lattice of the rational 
elliptic surface 

£^£s:y^^x^ + S{t). (1) 

The 240 minimal vectors of this lattice correspond 2 : 1 with the 120 odd elements of order 2 
in the Jacobian of G'; they correspond 3 : 1 with the 80 nontrivial elements of order 3 in the 
Jacobian of G; and they correspond 6 : 1 with the 40 representations of S as the difference 
between the cube of a quadratic and the square of a cubic polynomial. Even the fact that 
there are always 40 such representations is far from well known, though it turns out that 
Clebsch [ |c| had already obtained this enumeration and, in collaboration with Jordan, also 
its relation with what we now call J(G)[3]. 

After this paper was largely completed I found that the identification of the J(G)[3] and 
J(G')[2] moduli spaces via the minimal solutions of = a;'^ -|- S{t) was already obtained by 
van Geemen in ivCJ, and in essentially the same way. He was led to it via the study of theta 
functions. The interpretation of this result in terms of the Mordell-Weil lattice of and 
the connections with other geometric ideas given or announced here, still appear to be new. 

Acknowledgements. Thanks to Daniel AUcock for bringing Q to my attention, and to 
Allcock and Joe Harris for helpful discussions. Jordan EUenberg read an earlier version of this 
paper and found several instances of confusing or mistyped prose whose correction markedly 
improved the exposition. 

This work was made possible in part by funding from the David and Lucile Packard Foun- 
dation. 

1. The sind Ai^ pictures of ) and their G-covers. Let fc be a field not of 
characteristic 2 or 3 containing a cube root of unity p. To six distinct points ti, . . . , on 
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'P^{k) permuted by Gal(fc/fc) we associate two curves defined over k: the genus 2 curve 

6 

C:u'^^S{t) = eY[{t-tj) (2) 

i=i 

up to quadratic twist e e k* /k*'^, and the genus-4 superelhptic curve (caUed a "Picard curve" 



in iHWj ) 
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C ^S{t)^eY[{t-tj) (3) 

up to cubic twist e £ k* /k*^. If some tj = oo the corresponding factor t — tj is replaced by 1. 
Except for the twists e, e, these curves do not depend on the choice of coordinate on P^(fc): 
changing t to (at + b)/{ct + d) changes Yiji^ " ^j) 



fat + b atj +b\ f ad — bc\^ yr t ^ tj 

\ct + d ^ ctj + d J ^ \ ct + d J jj^ ctj +d' ^ ' 



in which the factor ((ad — bc)/{ct + d))^ may be absorbed into or v^, while the constant 
factor 11^=1 (^j + is absorbed into e or e. The reader may check that in the special cases 
tj — oo or ctj + d — the formula ^ still holds with our interpretation of a factor "t — oo" 
as 1, and the factor ctj + d replaced by — c if tj — oo and by —{atj + 6) if ctj + d — 0. 

It is well known that every curve of genus 2 is of the form C for some choice of tj and e, 
uniquely determined by the curve up to the action of PGL2(fc), and that every principally 
polarized abelian surface which is not a product of elliptic curves is the Jacobian of such a 
curve. In other words, it is known that (^ ) is identified with the moduli space M2 of curves 
of genus 2, and with the open subset of the moduli space A2 of principally polarized abelian 
surfaces, namely the subset parametrizing indecomposable surfaces. 

It is also known, though not as widely, that the curves C yield another pair of moduli 
interpretations of (^ ). Just as the genus-2 curves C have the hyperelliptic involution {t, u) ^ 
{t, —u), the genus-4 curves have an automorphism of order 3 defined by 

Q{t,v)^{t,pv). (5) 

The fixed points of this involution are the six points above t = tj. By the Riemann-Hurwitz 
formula, an order-3 automorphism of any curve of genus 4 has six fixed points if and only if 
the quotient curve has genus 0. Now there are two kinds of genus-4 curves cyclically covering 
with degree 3: the curves C", and curves of the form 



i=l 
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The two kinds of curves are distinguished by the action of g on the four-dimensional space of 
holomorphic differentials on the curve. In both cases the fixed subspace is necessarily trivial 
because it is the space of holomorphic differentials on the quotient curve. But the action of g 
on H^{C') is diagonalized by the basis {dt/v^,tdt/v^,t'^ dt/v'^,dt/v), the first three of whose 
vectors have eigenvalue p and the last has eigenvalue p; whereas the p- and p-eigenspaces of the 
holomorphic differentials on (|6|) both have dimension 2. It follows that over an algebraically 
closed field the curves C" are precisely those genus-4 curves with an order-3 automorphism 
g whose 1-, p-, and p-eigenspaces have dimensions 0, 3, 1. When k is not algebraically closed 
we must also impose the condition that the quotient curve have a rational point (and thus be 
identified with P^)J3 We thus have a straightforward identification of ) with a space we 
shall call Aif, the moduli space of genus-4 curves C" with an order-3 automorphism g whose 
action on H^(C) has eigenvalues p,p,p,p. Less obvious, but still true, is that the Jacobian 
map A^4 — > Aa identifies with an open subset of the moduli space yl| of what we shall 
call "g-fourfolds" : principally polarized abelian fourfolds with an order-3 automorphism g 
acting on the tangent space with eigenvalues p, p, p, p. This is seen by computing that yl| 
has dimension 3, the same as the dimension of A^|. |IIW| 

Now each of A2 and A^^ has natural arithmetic covers: the moduli spaces of principally 
polarized abelian surfaces or gi-fourfolds with additional (torsion) structure. We will thus 
obtain two families of natural covers of (^ ). For instance, for each prime p the moduli space 
of principally polarized abelian surfaces with full level-p structure — that is, with a choice 
of generators for their p-torsion — is a Galois cover of A2 with Galois group PSp4(Fp) = 
Sp4(Fp)/{±l} if the ground field contains thep-th roots of unity. [Not all of GL4(Fp) because 
the Galois group must respect not only the group structure but also the Weil pairing on the 
p-torsion; we divide by {±1} to account for the quadratic twists.] When p = 2, a full level-2 
structure on J{C) is just an ordering of the six Weierstrass points, because the 15 nontrivial 
2-torsion points are represented by the differences between pairs of Weierstrass points; thus 
we obtain the cover of ( ^ ) by the moduli space Pj^g^ of ordered sextuples of distinct points on 
modulo PGL2. Of course the Galois group of this cover is the symmetric group S'g, so we 
have recovered the identification of this group with the symplectic group Sp4(F2). Likewise 
we obtain Galois covers of A^^ from the p-torsion points of fourfolds, with the Galois group 
this time depending on whether p is ramified, inert or split in Q(p). For the ramified prime 3 
we obtain an intermediate cover from the 3-torsion points in the kernel of \/— 3 = g — g- 
These constitute a 4-dimensional space over F3; the Weil pairing yields a quadratic from 
on this space, taking a (\/— 3)-torsion point P to the pairing of P with any of the 3-torsion 
points P' such that P = \/—3P'. This quadratic form turns out to have Arf invariant 1, 



^If we worked over a field k not containing p we would also have to require tiiat the involution in Gal(fc(p)/fc) 
take g to ■ 
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so we obtain a cover of with Galois group S0^(F3). Again the (-\/— 3)-torsion points on 
J{C') are generated by the differences between the six points with t = tj (note that these 
are fixed by g, so in the kernel oi 1 — p = py/—3), so once more we find the cover of ) 
by P^gj, whose Galois group 5*6 is this time identified with SO4 (F3). (Sec [C&, p. 4] for the 
realizations of Sq as linear groups in characteristics 2 and 3.) 

The next cases are p = 3 for A2 and p = 2 for A^. We already know that the former yields 
a cover of A2 with Galois group PSp4(F3) ~ G. As to the latter, the even prime is inert in 
Q(p), so the 2-torsion points of J{C') have the structure of a four-dimensional vector space 
over F4. The Weil pairing is an alternating F2-bilinear map on that space consistent with 
the F4 structure (that is, such that {P,P') = {gP, gP'))\ this yields a natural unitary form 
on J(C")[2], namely v : P ^ {P,gP). Thus the Galois group for the cover of A^ by the 
moduli space of ^^-fourfolds with a full level-2 structure is U4(F2)/F4 — SU4(F2) is again G. 
(We divide by the center F| to account for the cubic twists; since the dimension 4 is coprime 
to 3 = #F4, the resulting group PU4(F2) is isomorphic with SU4(F2). So we have again 
obtained, from two different curves C, C associated with a six-point configuration in and 
full level structures of different levels, covers of (^ ) with the same Galois group. But, unlike 
the case of the S'g-cover of ( ^ ) by Pj-g^ , it is not immediately clear here that the two G-covers 
of (^ ) are the same. This is what Hunt and Weintraub proved transcendentally in |HW|, 
and what we next prove algebraically by identifying both spaces with the space of sextics 
S{t) together with all minimal solutions {x, y) £ k[t] x k[t] of y'^ = + S{t). 



2. Torsion divisors and S. The action of GL2 on sextic polynomials S{t) also respects 
the minimal solutions of y"^ = x^ + S{t). Indeed, substituting {at + b)/{ct + d) for t in 
y"^ — x^ + S{t) and multiplying by {ct + d)^ yields y\ = x\ + Si{t), where xi,yi,Si are the 
images of x, y, S under (^^) G GL2. Thus the set of minimal solutions of = a;'^ -I- S* makes 
sense even if S is not a sextic polynomial but a GL2-orbit of such polynomials, or a point 
on (^ ). In the sequel we assume for simplicity that no tj = 00, i.e. that S{T) is of degree 6, 
not 5. The argument can be readily adapted to handle the case of a quintic; alternatively 
we may make all the tj finite by first applying a PGL2(fc) transformation, which as already 
observed does not change the problem. Note that since k 3 p and k is not of characteristic 2 
or 3 there are always more than six points in P^(fc) so such a transformation must exist. 

Theorem 1. Let y e k[t] be a polynomial of degree at most 3 sucii that y^ — S{t) is a 
k* -multiple of a cube in k[t]. Then the rational function y{t) — u on G has divisor 3D for 
some nonprincipal divisor D. Each nonzero 3-torsion element of J{C) arises as the class of 
such a divisor D for exactly one choice of y. 
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Remark: This theorem, and its following proof, turn out to have already been given (in older 
terminology, and only over C) in |cj]. 

Proof: On C we have S{t) = u^, so ax^ ^ — S{t) factors as ax^ = {y — u){y + u). Now 
y, S have no common roots: at such a root x^ would vanish, so it would be a root of x as 
well; but then S = y^ — ax^ would vanish there to order at least 2, which is not allowed. 
Thus y, u have no common zeros on C, and u has a triple pole at each of the points at infinity 
while y has a pole of order at most 3 there, and neither y nor u has any pole at a point of C 
with t ^ oo. Thus the y ^ u and y + u have no common zeros nor any finite poles, and at 
each point at infinity at least one of ?/ ± u has a triple pole while the other has a pole of 
order < 3. Now x is regular except for poles of order at most 2 at the points at infinity. 
Since (y + u){y — u) = ax^ , we conclude that the divisor of y + u has the form 3D for some 
divisor D on C. Since 3D is principal, the class [D] e J{C) of _D is a 3-torsion point on the 
Jacobian. But D cannot itself be principal. Indeed, if it were, {y + uy/^ would be a rational 
function on C of degree at most 2. But all such functions are in k{t). Thus y + u would also 
in k{t), which is absurd because ye k{t) but u ^ k{t). Thus [D] is a nonzero 3-torsion point 
in the Jacobian, as claimed. 

Now let the divisor K on C be the sum of the two points at infinity; this is the divisor of the 
differential dt/u on C, and is thus canonical. Since D includes at worst simple poles at the 
points at infinity, D + K is an effective, non-canonical divisor of degree 2. Conversely, let T 
be a nonzero 3-torsion point in J{C). Then + r is a non-canonical divisor class of degree 2, 
and so by Riemann-Roch has a unique effective representative, which we may call K + D. 
Since D ^ T, the divisor 3D is principal, say the divisor of / e (fc(C))*. Since 3K + 3D is 
effective, this function / has at most triple poles at the points at infinity, and no other poles; 
thus it is of the form yi{t) + cu where yi is a polynomial of degree at most 3 and c (z k. 
If c vanished then / would be a rational function of t which, when considered as a function 
on C, would have all its zeros and poles of order divisible by 3. But then the same would be 
true of / as a function on P^, because the cover i : C — > has degree prime to 3. Thus / 
would be a fc*-multiple of the cube of a rational function — and this rational function would 
have divisor D, which would therefore be principal, contradicting the hypothesis that T ^ 0. 
Thus f = yi + cu for some c E k* , and multiplying by we find the unique function of 
the form y + u whose divisor is 3D for some D ^ T. It remains to prove that — S{t) is 
a fc*-multiple of a cube. Since the divisor of y + m is divisible by 3, the same is true of the 
divisor of y — u, which is the image of y + u under the hyperelliptic involution. But then the 
same is true of (y ^ u){y + u) = y^ — S{t)^ considered as a function on C. As noted already 
this means that the divisor of y^ — S{t) considered as a function of t is also divisible by 3, so 
y^ — Sit) is indeed of the form ax^ for some x £ k[t] and we are done. 
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Corollary: i) There are 3^ — 1 = 80 polynomials y E k[t\ of degree at most 3 such that 
y^ — S{t) is a cube in k[t]. 

ii) iS is a G-cover of ), isomorphic with the moduli space of curves of genus 2 with full 
level-3 structure. □ 

Much the same analysis applies to the curves C . The one difference here is that the divisor 
Kq, consisting of the sum of the points (now three of them) at infinity, is no longer canonical: 
the differential dt/v"^ has divisor 2Kq, not Kq. Thus Kq is a distinguished semicanonical 
divisor, a.k.a. theta characteristic. (As with the divisor of K on G, the divisor Kq depends 
on the choice of coordinate t on P^, but the linear equivalence class of Kq does not: if we 
used instead a coordinate whose pole is at r 7^ cx3 our divisor would be the fiber of r under 
the map t : C ^ , and the difference between these two Kq^s would be the divisor of 
the rational function t — r.) Using Kq we may identify 2-torsion elements T of the Jacobian 
with theta characteristics Kq + T. Recall that there is an affine-linear quadratic form on 
the theta characteristics which gives the parity of the space of sections; for genus 4, there 
are (2^ + 2^^)72 = 136 even and (2® - 2'*)/2 = 120 odd theta characteristics. The divisor 
Kq has the two-dimensional space of sections generated by 1 and so it is an even theta 
characteristic. None of the other 135 even thetas 8 has any nonzero sections: if it did then 
by parity it would have at least 2 independent ones, whose ratio would be a degree-3 map 
to not of the form (at + b)/{ct + d): but there are no such rational functions on G'. The 
same argument shows that each odd theta has only a one-dimensional space of sections, and 
thus has a unique effective representative. 

We are now ready to state the analogue of Theorem 1 for G': 

Theorem 2. Let x E k[t] be a polynomial of degree at most 2 such that — S{t) is a 
k*-multiple of a square in k[t]. Then the rational function x{t) + v on G' has divisor 2D for 
some nonprincipal divisor D such that [D + Kq] is an odd theta characteristic. Each odd 
theta characteristic arises in thus way for exactly one choice of x. 

Proof: On G' we have S{t) = v^, so Py'^ = x^ + S{t) factors as (3y'^ ~ (x + v){x + pv){x + pv). 
Now X, S have no common roots: at such a root y^ would vanish, so it would be a root of y as 
well; but then S — /3y^ — x^ would vanish there to order at least 2, which is not allowed. Thus 
x, V have no common zeros on G', and v has double poles at each of the points at infinity 
while X has a pole of order at most 2 there, and neither x nor v has any pole at a point of G' 
with t ^ 00. Thus, of the three factors x + v, x + pv, and x + pv of Py'^, none has a pole 
except at one of the points at infinity; each has poles at infinity of order at most 2; at each 
point at infinity at most one factor may have a pole of order < 2; and no two have a common 
zero. Now y is regular except for poles of order at most 3 at the points at infinity. Since 
(x + v){x + pv){x + pv) = /3y'^, we conclude that the divisor oix + v has the form 2D for some 
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divisor D on C . Since 2D is principal, the class [D] € J{C') of D is a 2-torsion point on the 
Jacobian. But D cannot itself be principal. Indeed, if it were, [x + v)^/"^ would be a rational 
function on C of degree at most 3. But all such functions are in k{t). Thus x + v would 
also in k(t), which is absurd because x G k{t) but v ^ k{t). Thus [D] is a nonzero 2-torsion 
point in the Jacobian, as claimed. Moreover [D + Kq] is an odd theta characteristic, because 
D + Kq is an effective semicanonical divisor not equivalent to Kq. 

Conversely, let T be a nonzero 2-torsion point in J(C") such that Kq + T is an odd theta 
characteristic, and let Kq + D he the unique effective divisor in the class of Kq + T. Since 
D i^T, the divisor 2D is principal, say the divisor of f G {k{C'))*. Since 2K + 2D is effective, 
this function / has at most double poles at the points at infinity, and no other poles; thus it 
is of the form Xi(t) +cu where xi is a polynomial of degree at most 2 and c € k. If c vanished 
then / would be a rational function of t which, when considered as a function on C", would 
have all its zeros and poles of order divisible by 2. But then the same would be true of / as 
a function on P^, because the cover t : C ^ has degree prime to 2. Thus / would be 
a fc*-multiple of the square of a rational function — and this rational function would have 
divisor D, which would therefore be principal, contradicting the hypothesis that T ^ 0. Thus 
f = xi + cv for some cG k*, and multiplying by we find the unique function of the form 
x + v whose divisor is 3D for some D ^T. It remains to prove that + S{t) is a A;*-multiple 
of a square. Since the divisor oi x + v is divisible by 2, the same is true of the divisors of 
X -\- pv and x -\- pv, which are the images of .x + w under g and . But then the same is true 
of (x + v){x + pv){x + pv) = x^ + S{t) considered as a function on C . As noted already 
this means that the divisor of x^ -\- S{t) considered as a function of t is also divisible by 2, so 
x^ + S{t) is indeed of the form for some y G k[t] and we are done. 

Corollary: i) There are 120 polynomials x S k[t] of degree at most 2 such that x'^ + S{t) is a 
square in k[t\. 

ii) <S is a G-cover of (g ), isomorphic with the moduli space of curves C with full level-2 
structure. □ 

Combining the Corollaries to Theorems 1 and 2 we obtain 

Theorem 3. i) Any polynomial S{t) of degree 5 or 6 without repeated roots over an alge- 
braically closed field not of characteristic 2 or 3 can be written as — x^ for 240 pairs of 
polynomials x{t),y{t) of degree at most 2 and 3 respectively. 

ii) Over any field not of characteristic 2 or 3 that contains the cube roots of unity, the moduli 
space of curves C with full level-3 structure and the moduli space of curves C with full level-2 
structure are isomorphic G-covers of ). 

Proof : i) This follows from part (i) of the Corollary to either Theorem 1 or Theorem 2 since 
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240 = 3 • 80 = 2 • 120. (Fortunately the two computations agree!) 

ii) This foUows from parts (ii) of the Corollaries to Theorems 1 and 2, which identify both 
G-covers with S. □ 

The reader may have already surmised that the distinction between even and odd theta 
characteristics is equivalent to the unitary structure on J(C")[2]. Indeed a theta characteristic 
Kq + T is even according as v{T) = or 1. This can be proved as follows. Let q be the 
afhne-quadratic form on the theta characteristics Q which is on even and 1 on odd 0's. 
It is known Q that q is compatible with the Weil pairing: if T, T' are any 2-torsion points 
then their Weil pairing {T,T') (written additively to take values in F2 rather than {±1}) is 

q{Q) + q{e + T) + q{e + T') + 9(6 + T + T') (7) 

for any choice of 8. Now take Q — Kq and T' — qT. We have seen already that q{Ko) — 0. 
Moreover q is gi- invariant. Thus (|^) reduces to 

q{Ko +T)+ q{Ko + qT) + q{Ko + g^T) = 3q{Ko + T) = q{Ko + T). (8) 

Thus q{KQ +T) = (T, gT) = viT) as claimed. 

3. The lattice E^. Our proof of Theorem 3 does not entirely dispel the mystery of 
the coincidence of the two G-covers of (^ ); even the fact that PSp4(F3) and SU4(F2) are 
isomorphic appears to emerge as an accident — in fact formulate our analysis so that the 
isomorphism PSp4(F3) ^ SU4(F2) arises as a by-product! But there is a more satisfactory 
approach to this isomorphism via the low-dimensional representations of G and its double 
cover 2.G = Sp4(F3). (The existence of 2.G cannot be so readily seen from the SU4(F2) 



model of G: except for in characteristic 2 have trivial Schur multiplier |C&, p.xvi].) A four- 



dimensional representation V of 2.G will also figure in a third description of our G-cover 



5 of (^ ) which will naturally yield both the J(G)[3] and J(G')[2] pictures of S via the 
reduction of V mod 2 and 3 respectively. 

It is known that Aut(G) contains G with index 2. When G is viewed as PSp4(F3), the outer 
automorphisms are linear transformations that multiply the symplectic form by —1, while in 
the SU4(F2) viewpoint they are conjugate-linear transformations preserving the unitary form. 
Thus Aut(G) appears as PGSp4(F3) in characteristic 3 and as SU4(F2) in characteristic 2. 

We take for V one of the two irreducible representations of 2.G of dimension 4; the choice 
does not matter, because these representations (which are each other's contragredient) are 
exchanged by an outer automorphism of G. This representation is defined over Q(p); the 
image of 2.G in GL(V") (actually SL(4)), extended by the three-element group (p) of /.tg- 



multiplcs of the identity, is a complex reflection group — #32 in the list of [3T, p. 301] 
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generated by what the ATLAS caUs "triflections" : hnear transformations of order 3 with a 
codimension-l fixed subspace. The nontrivial central element of G acts on V by multiplication 
by —1; thus the exterior square W = A^V is a. 6-dimensional representation of G. It turns 
out that this representation is irreducible and defined over Q. This time W is the unique 
representation of its dimension; thus the action of G on W extends to Aut(G). It turns out 
that the image of Aut(G) in GL(W^) is again a reflection group, #35 in |ST| , which is to 
say the Weyl group W{Eq). Kneser observed in Q that reducing the action of Aut(G) on 
the Eq lattice mod 3 and 2 identifies Aut(G) with PGSp4(F3) and SU4(F2) in these groups' 
guises as the orthogonal groups S05(F3) and SOg (F2). Indeed Eq/SEq is and Ee/2Ee are 
orthogonal spaces of dimensions 5 and 6 over F3 and F2 respectively, both with actions of G 
respecting the quadratic form; but the simple groups S05(F3) and SOg (F2) are barely large 
enough to accommodate copies of Aut(G), so the reduction maps from Aut(G) to these two 
groups must be isomorphisms. In particular the isomorphism between S05(F3) and S0g^(F2) 
follows. Kneser explains most of the sporadic isomorphisms between classical simple groups 
in the same fashion. But he docs not observe an alternative explanation along the same lines 
of the isomorphism PSp4(F3) = SU4(F2) using a 2.G-lattice E^ C V; and it is this lattice, 
and its reductions mod 2 and 3, that are central to our story.S 

We call the lattice Eg because it is obtained from the Es root lattice by choosing a 3-cycle 
in W{Es) acting on E^ with trivial fixed space. There is a unique conjugacy class of such 
3-cycles; identifying one with p gives Eg the structure of a four-dimensional lattice E^ over 
Z(p). Its group of automorphisms is the subgroup of W{Es) commuting with p, which is 
isomorphic with (2.G) x (p). We choose the isomorphism so that E^ (g) Q(p) is our repre- 
sentation V of 2.G rather than its contragredient. We can also describe E^ explicitly as the 
sublattice of Z[p]^ consisting of vectors congruent mod (V— 3 ) to a linear combination of 
(1, 1, 1, 0) and (1, —1, 0, 1), i.e. to a vector in the "tetracode" in Fg. (That is, E^ is the Z[p]- 
lattice obtained from the tetracode by "Construction Ac" JC^ , p. 200, Example lib].) The 
inner product (z, 2*) of z = (zi, Z2, Z3, Z4) with z' — {z[, Z2, Zg, Z4) is | J2'j=i ^-'p z' G E^ 
then (z, z") € (2/V— 3 )Z[p]. The roots (nonzero vectors of minimal norm) are the 2-3-4 = 24 
multiples of unit vectors by ±fj,^\/—3, and the 3'^(3^ — 1) = 216 minimal lifts of the 8 nonzero 
vectors of the tetracode. (Each such vector has 1 zero and 3 nonzero coordinates; in a minimal 
lift, lifts to 0, and ±1 lifts to one of the three choices ifJ.^-) This adds up to the familiar 
count of 24 + 216 ~ 240 roots. For each root r we obtain a triflection x x + ^{p~ l){x, r)r, 
and these triflections generate Aut(£'g) = (2.G) x (p). Reducing mod 2, we obtain an F4- 
vector space Eg/2Eg of dimension 4 with a Hermitian form ^(•, •) mod 2. Thus the group 

■^Remarkably G is involved in yet another complex reflection group, the 5-dimensional #33 defined over 
Q(p). The reductions of this representation mod 3 and 2 again yield the isomorphism PSp4(F3) = SU4(F2). 
We do not pursue this here because this representation does not enter into our investigation of S. 
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Auti(£'g) of linear isometries of of determinant 1 maps to a subgroup of SU4(F2); the 
kernel consists of the identity and multiplication by —1, so we find Auti(G)/{±l} as a 
subgroup of SU4(F2). Reducing mod (y/^), we obtain an Fa-vector space E^/y/^E^ of 
dimension 4 with a symplectic form \/^(-, •) mod {\/^^). Thus Auti(-Eg) maps to Sp4(F3), 
this time with trivial kernel (since the scalar multiplications by p, p are excluded from Auti). 
We thus find Auti(G')/{±l} as a subgroup of PSp4(F3). Since SU4(F2) and PSp4(F3) are 
barely large enough to accommodate Auti(£'g)/{±1}, these two inclusions must be isomor- 
phisms. We have thus explained the isomorphism of SU4(F2) with PSp4(F3) by regarding 
these two groups as the mod-2 and mod-3 manifestation of the characteristic-zero object 
{Eg,2.G). This explanation extends to Aut(G'): the outer automorphisms oi 2.G act on E^ 
by conjugate-linear isometries such as {zi, Z2, z^, Z4) 1-^ (^1,^2,^3,^4). Such maps descend 
to conjugate-hnear automorphisms of E^/2Eg, and thus extend SU4(F2) to SU4(F2); on 
Eg/\/—3E^, they act linearly but reverse the symplectic pairing, thus extending Sp4(F3) to 
GSp4(F3) and PSp4(F3) to PGSp4(F3). Therefore SU4(F2) Aut(G) = PGSp4(F3). 

~3. Since ^(r, r) ~ 1 for every 



We next consider the reduction of the 240 roots mod 2 and \ 
root r, all the roots belong to one of the 120 odd classes in E^/2E^\ clearly r and — r arc in 
the same class, and conversely if r, r' are roots congruent mod 2E^ then r' — ±r because at 
least one of ^(r ± r') is a lattice vector of norm < 2. But there are 240 = 2-120 roots; since 
each odd class mod 2E^ accounts for at most two of them, each of the 120 odd classes must 
be represented by a pair ±r of opposite roots. Modulo %/— 3, no root is congruent to zero 
and every root r is congruent to r, pr, and pr, and to no other roots r' lest 

2 



4 = 



r — r' 


2 
+ 


pr — r' 


2 

+ 


pr — r' 










V=3 



(9) 



be the sum of three positive even integers. Thus each of the 80 nonzero classes in E^/ 
contains at most 3 roots, and since there are 240 = 3-80 roots we again conclude that each 
nonzero class represents a triple n^jj- of roots. Of course these counts 240 = 3 - 80 = 2 - 120 
are highly suggestive of the counts in the first parts of the Corollaries to Theorem 1 and 2; 
we make the connection in the next section. Note that we could also have obtained these 
results from the identification of Aut(i?g )/{±l} with SU4(F2) and PSp4(F3), together with 
the fact that the unitary and symplectic groups act transitively on odd and nonzero vectors 
respectively. 

We conclude our description of the lattice E^ by noting that it could also have been defined 
directly from the representation of 2.G on V, without recourse to the Eg, root lattice: the 
representation is globally irreducible, and thus has a unique (2.G)-stable lattice, see j^, [Th[ . 



4. The elliptic surface £ and its Mordell-Weil lattice. We next identify E^ with the 
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group of fc(f)-rational points on the elliptic curve 

a;^ + S{t) (10) 

for each sextic S € k[t\ without repeated roots. To do this we must show not only that 
£{k{t)) is a free abelian group of rank 8 but also specify an action of p and a quadratic form 
and an action of p on that group. We will have p act by complex multiplication (CM), and 
take for the quadratic form the canonical height h on the points of an elliptic curve over a 
function field. We describe these extra structures (CM and h) on £(k{t)) in turn, and then 
show that the resulting Z[p] -lattice is isometric with E^. 

That £ has complex multiplication by Z[p] means that Z[p] acts on £ by endomorphisms 
(a.k.a. isogenics, i.e. algebraic maps commuting with the group law). Giving £ a Z[p] action 
means exhibiting an endomorphism p satisfying the minimal equation p^ + p + 1 = 0, with 
addition defined via the group law on £. Such an endomorphism is p : {x,y) i-^ {px,y): the 
three images {x, y), (px, y), [px, y) of a generic point (.t, y) on £ under 1, p, p^ have the same 
t/-coordinate, and thus are the intersection of £ with a line and sum to zero in the group law. 
We chose (x, y) ^ {px, y) rather than [x, y) ^ {px, y) so that p multiplies by p the invariant 
differential oj = dx/y on £; hy linearity it follows that = w for every endomorphism 
(j) e Z[/9]. We note for future use that any ^ e Z[p], considered as a map ^ : £ ^ £, has 
degree (j)4>- Every elliptic curve in characteristic other than 2 or 3 that has CM by Z[p] can 
be written as y'^ = + ae for some nonzero ae, and so becomes isomorphic with 

Eo -.V^ = + 1 (11) 

once we extract a sixth root of 05. Thus in our case £ becomes isomorphic with the constant 
curve Eq when we extend k{t) to the function field of the curve 

C" : = S{t). (12) 

This is a curve of genus 10 (a smooth plane sextic) which cyclically covers both C and C 
in degrees 3 and 2 respectively. The identification of £ with Eq over k{C") takes {x,y) 
to {w~'^x,w~^y). Thus the fc(t)-rational points of £ are identified with the subgroup of 
£o{k{C")) consisting of the fc(C")-rational points P of Eq whose image under the generator 

of Gal{k{C") / k{t)) taking w to —pw is —pP. Note that £a{k{C")) may be regarded as the 
set of maps P : C" — > Eq, which inherits an abelian group structure from the group law 
of Eq; in this viewpoint £{k{t)) consists of those maps P for which the diagram 

C" £0 

i i (-P) (13) 

C" Eo 
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commutes using {t,w) t-^ {t,—pw) for the arrow C" — s- C" . Likewise we may regard the 
fc(t)-rational points of £ as maps from C" to Eq defined over k for which that diagram 
commutes. 

We next review the canonical height on £\ see Ch.VIII§9 (pp. 227-233)] for the canonical 



height on a general elliptic curve over a global field, and |CZ, 3h. M for the present case of 



an elliptic curve, especially one of constant j-invariant, over a function field. 

Let E be any elliptic curve over a function field F = k{Ca), with Cq a curve over the field 
of constants k. Choose a Weierstrass model for i5, with coordinates x, y. The naive height 
on is a function h on its group E{F) of rational points whose value at a point measures the 
point's complexity: the zero point has naive height 0, and a nonzero point {x, y) has naive 
height 

2 

:= max(degx, -degy). (14) 

(Here "deg" is the degree of an element of the function field considered as a rational map 
to P^. Note that h depends on the choice of Weierstrass model for E, though the naive 
heights associated with different models differ only by 0(1). The canonical or Neron-Tate 
height on the curve does not depend on the choice of model. It is a Q-valued quadratic form 
h on E{F), positive-definite on E{F) ®z; R, which is within ±0(1) of the naive height h. 
These two properties uniquely characterize h. It follows that if Ci is a curve covering Cq 
with degree d, and F' /F the corresponding field extension with F' — k{Ci) and [F' : F] — d, 
then for any P E E{F) its height hpi{P) as a point of E{F') is d times its height hp{P) as 
a point of E{F). Indeed, from the definition (^4|) it is clear that the naive heights hp,hpi 
satisfy hp' — dhp, so the restriction of d^^hpr to E{F) satisfies both criteria for hp. 

There are other equivalent characterizations more suitable for computing h. Tate showed that 
for any endomorphism (j) : E E oi degree > 1 the canonical height is the unique function 
h : E{F) R such that h = h + 0(1) and h{aP) = {dega)h{P) for every P £ E{F). 
(Usually one takes for cj) the multiplication- by-n map for some small n > 1 such as n = 2, 
but we'll be able to exploit ^ Z as well.) Neron described h in terms of intersection theory 
on E considered as an elliptic surface over the constant field of F, which yields a formula 
for h{P) — h{P) as a finite sum of terms each depending on the reduction of P at one of 
the places where E has bad reduction. From either characterization it follows that if E is 
a curve defined over k, so points P E E{F) are equivalent to rational maps P : Cq E, 
then the height of such a point equals twice the degree of P; that is, the canonical height 
equals the naive height. (Since x, y arc functions of degree 2, 3 on i?, the definition (|l^) gives 
h{P) = 2deg(P).) In Tate's viewpoint we see this by noting that h already satisfies the 
condition h{aP) — {deg a)h{P), because the degree of rational maps is multiplicative under 
composition. In Ncron's approach this follows because there are no places of bad reduction 
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and thus no contributions to the sum for h — h. 

In our case ( p^ ) of the curve £ over k(t) or k{t) we Ukewise show that h = h: 

Proposition. The canonical height of any nonzero P e £(k{t)) with coordinates {x,y) is 
given by h{{x,y)) — max(dega:, | degy); that is, the naive and canonical heights on £{k{t)) 
are equal. 

As in 1^ we can give several proofs: one using the relationship with maps C" Ei, one 
using Tate's characterization, and one using Ncron's formula. We give the first and third 
proof, leaving the second as an exercise; each of the three introduces ideas that will later 
figure in our determination of £{k{t)) and its connection with S. The following observations 
regarding the nai've height will be helpful throughout. Let P : {x{t),y{t)) be a nonzero point 
on £{k{t)). Let r,- e P^{k) be the points at which either x or y has a pole, with to — oo. For 
j > 0, since S — y^ — x^ is regular at tj g fc, the functions x^, must have poles of the same 
order at t^, so x,y ave poles of order 2nr, 3n,. at for some integer n^. At to — oo, y'^ — x^ 
has a pole of order 5 or 6, so at least one of x^,y^ has a pole of order 6, and if one has a pole 
of order > 6 then both do and the pole orders are equal. In this last case x,y have poles of 
order 2rio, 3no at t — oo for some integer hq > 1; otherwise we define uq = I. Then 

h{P) ~ niax(degx, - degy) = 2 ^ n^. (15) 
o 

r 

We proceed to our proofs of the Proposition: 

Proof 1: Extending k{t) to k{C") we have identified £ with the constant curve Eq and 
associated to P : {x,y) the rational map {t,w) i-^ {w~''^x{t),w''^y{t)) from C" to Eq. We 
calculate the degree of this map by finding the preimages of the origin of Eq and their 
multiplicities. We find that the preimages are the points with t — Tr for some r, with 
multiplicity Gn^ if = tj for some j and rir if not. In either case contributes Qrir to the 
total, because there are six points of C" with t = unless = tj in which case there is only 
one. Thus the degree of the map is Q^^n^ — 3h{P), and its height as a point of £{k{C")) 
is twice that degree, or 6h{P). Since [k{C") : k{t)] = 6 we divide by 6 to obtain the claimed 
formula h{P) — max(dega;, | degy) for the height of P as a point of £{k{t)). 

Proof 2 (sketch): Use Tate's characterization, taking for (f> the isogeny 

.3x3-4y2 9x3y-8y3 
V-3 = p-p: (x,y) ^ ( ,V-3 ) (16) 

or 

9^4^^ 27a;6 - 9a; V + y^ 

2 i-^y) - ( ' 8? ^''^ 

That is, show that these multiply the nai've height by 3 and 4 respectively. This can be done 
directly by counting poles of the coordinates of {x, y) and 2 • (x, y). (One can also invoke 
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the computation in Proof 1 together with the fact that and 2, considered as isogenics 

of Eo, have degrees 3 and 4 respectively.) 

Proof 3: In Ncron's approach the terms in the formula for h — h are indexed by places where 
E reduces to a decomposable curve. We will show that £ as no such places and thus that 
h — h ~ 0. Specifically, we'll show that the singular reductions of £ are six cubics = x^, 
with Kodaira type II indicating a cusp singularity at the origin but only one component, 
at the roots tj of S{t). This is clear for finite t, where £ is smooth as a surface (since the 
tj are distinct) so the reduction of the Ncron model is obtained by just specializing t, with 
no further blowing up necessary. At < oo the coefficient S{t) has a pole, so we change 
coordinates to {x' ,y') — {x/P ,y/t^) satisfying y'^ — x'^ + {S{t)/t^), which at t = oo is again 
smooth as a surface and so reduces to a cuspidal cubic if some tj = oo and to an elliptic 
curve if not. □ 

Corollary: i) The canonical height of every point P E £(k{t)) is an even integer, which is 
positive unless P is the zero point. 

ii) The points P of height 2 are those coming from minimal solutions of y"^ = x^ + S{t). iii) 
The torsion subgroup of £{k(t)) is trivial. 

Proof: i) By the Proposition it is enough to prove this for the naive height. The height of 
the zero point is 0. In ([l5| ) we exhibited the naive height of a nonzero point P as a sum of 
nonnegative even integers 2nr, including the positive 2nQ. Thus h{P), and so also h(P), is a 
positive even integer as claimed. 

ii) If ~ ^ then no = 1 and all other rir = 0. Thus x,y have no poles at finite t, and 
have poles of orders at most 2, 3 at i = oo. Conversely it is clear that if x, y are polynomials 
of degrees at most 2, 3 with y^ = x^ + S then P : (x,y) is a rational point on £ with h{P) = 2, 
and thus by the Proposition also with h{P) = 2. 

iii) If P is a torsion point then nP = for some n > 1, so h{P) — n^^h{nP) ~ 0. By part 
(i) it follows that P = 0. □ 

On general principles, the group of fc(t)-rational points of £ is finitely generated. The Propo- 
sition and its Corollary tell us that this group is torsion-free and has the structure of an even 
Z[p]-lattice L5, a.k.a. the Mordell-Weil lattice of £, in the unitary space {£{k{t))) ^z[p] C = 
Ls ®z[p] C, whose minimal nonzero points are the minimal solutions of y'^ = x^ + S{t). We 
already know that there are 240 such, so the following theorem should come as no surprise: 

Theorem 4. For every sextic polynomial S{t) without repeated roots, the lattice Ls is 
isomorphic with E^. 

Proof: It is enough to prove that Ls is isomorphic with Es as a Z-lattice, because E^ is 



15 



Eg with its unique structure as a Z[p]-lattice. We determine L5 as a Z-lattice by relating 
it with the Neron-Severi group NS(£) with £ considered as a surface over k. To do this we 
need a model of £ smooth everywhere, including t = 00. But we have obtained such a model 
in the course of proving the h = h proposition. This model is the union of the two open 
subsets t ^ 00 and i 7^ 0, each of which is represented as a surface in x P^: for t ^ 00, 
using coordinates t, [x : y : 1); for i 7^ 0, using [x/t^ : y/t^ : 1). We saw already 
that this surface has no reducible fibers and that its only singular fibers are reductions of 



Type II at the six points t — tj. It follows as in |Sh] that £ is a fc-rational elliptic surface, 
so that its Neron-Severi group is the even unimodular lattice of signature (1, 9), of which the 
zero section and the fibers contribute a unimodular sublattice of signature (1, 1) ("hyperbolic 
plane"), and therefore the Mordell-Weil lattice is even unimodular of rank 8. Thus Ls = -Eg 
as claimed. □ 

Remark: We could also have proved that Ls has Z[p]-rank 4 using the description of L5 
as maps to Eq from C" , or equivalently with its Jacobian J{C"), that make (|3|) commute. 
Now the for < s < 6 the (— /3)^-eigenspace of the action of (i, w) i— > (t, — pw) on H^{C") 
consists of the differentials P{t) dt/w^^'^ with P E C[t] of degree at most 4 — s, and thus 
has dimension 5 — s. For s = 3 these are the differentials pulled back from H^{C) via the 
triple cover C" — > C; for s ~ 2 and s = 4 they arc the puUbacks of H^iC') via the double 
cover C" C . Thus Ls consists of the annihilator in Hom( J(C"), i?o) of the images of 
J{C), J{C') in J{C"). The quotient of J{C") by the abelian variety generated by J(C) and 
J(C") is an abeUan variety Jo(C") of dimension g{C") - g{C) - g{C') = 10-4-2==4 
equipped with an endomorphism inherited from (t, w) i— > (t, ~pw) that multiplies all its 
holomorphic differentials by —p. Thus Jo{C") is an abelian fourfold isogenous with Eq, and 
the rank of Hom( Jo(C"), i?o) equals that of IIom(i?Q , £'q) = Z[p]'^. Moreover, at least in 
characteristic zero we see that Jo(C"), and thus the lattice Ls, must be independent of S, 
because all components of the moduli space of fourfolds isogenous with Eq have genus zero. 
But it seems much harder to prove that Ls ^ E^ using this approach. 

By Theorem 4 the lattice of fc(t)-points on £ is independent of the choice of S. However, the 
field extension ks of k needed to define the points of £{k{t)) does depend on S. It is clear that 
this extension is finite, because £(k{t)) is generated by finitely many points. Moreover it is 
a normal extension, because Galois conjugation in ks permutes the solutions of the equation 
(|l|) defined over k. Furthermore, Gal{ks/k) respects the group law and canonical height 
on Ls- Thus we may regard Ls as a Ga^fc^/fc) module, and obtain a map from Gal(fc5/fc) 
(or even from Gal(fc/fc)) to Aut(Ls) — Aut(ii^g) = /Xg x (2.G). We have seen already that, 
at least modulo the center /Xg of Aut(£'g), this Galois representation depends only on the 
PGL2(fc)-orbit of S; in the Ls viewpoint, this is because if S, S' are sextics equivalent under 
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some g G PGL2(fc) then g : ^ lifts to an isomorphism from £3 to £s', using the 
models of these ehiptic surfaces constructed during the proof of Theorem 4. Our work in 
§2 shows that the representation Ga\{ks/k) ^ G is equivalent to the Galois representations 
on J(C)[3]/{±1} and J(C")[2], and is surjective onto G for generic S. But the curves C,C' 
also figure in the arithmetic of £ as an elliptic curve over k{t): their function fields are the 
extensions of k{t) generated by the (\/— 3)- and 2-torsion points of the curve. This is clear 
from our formulas (|l|,|7l) for the isogenics ^—3, 2 : £ £: the nontrivial (^—3 )-torsion 
points are those with x — and thus y = ±\/S = ±u, while the nontrivial 2-torsion points 
have y = and x = —\fS = —p^v. The 3- and 2-torsion groups of J{C) and J{C') then 
arise naturally in the descent on £ via the isogenics a/— 3 and 2, and allow us to extend 
the maps of Theorems 1 and 2 from the 240 roots to J(C)[3] and J'{C)[2] to isomorphisms 
Ls/ Ls ^ J(C)[3], Ls/2Ls ^ J(C")[2]. We thus obtain our final explanation of the 
equivalence between the G-covers of ) coming from J(G)[3] and J'{C)[2]: as with the 
mod-3 and mod-2 manifestations of G itself, the two constructions of the G-cover 5/ ( g ) are 
now revealed as the mod-(\/^3 ) and mod-2 manifestations of the projective representation 
of Gal(fc//c) on Ls- We next exhibit these descent maps; see g Ch.VIII] for the general 
theory of descent of which they are two special cases. 

We deal first with the 3-isogeny \/— 3. We observed that its kernel is generated by the 3-torsion 
point (0,u) with v? = S{t). This the function y — uE {k{C)){£) is a Weil function on £: its 
divisor is 3((0, — 3(0). Moreover y — u is locally a cube at its triple pole. Evaluation oiy — u 
mod cubes at points other than and (0, u) then extends to a homomorphism from £(k(C)) 
to k{C)*/k{C)*^ whose kernel is ^/^£(fc(G)). We claim that y/^£{k{C)) n £ik{t)) = 
3 £{k{t)): otherwise we have Q — \/~iQi with Q e £{k{t)) but Qi defined not over k{t) 
but only over k{C); but then the other preimages of Q under \/— 3 would be the translates 
of Qi by (0,±m), which are also defined over k{C), and we would obtain a cubic extension 
of k{t) split by the quadratic extension k(C)/k{t), which is impossible. Thus the restriction 
of our homomorphism from £(k{C)) to L5 = £{k{t)) yields an injection from Ls/^/—3 L{S) 
to k{C)* /k{C)*^ . Since the divisor of j/ — u on £ is divisible by 3, and £ has no reducible 
fibers, the divisor of y{t) — u on G is also divisible by 3; call it 3D. (This generalizes the 
argument of Theorem 1, and can also be shown as was done there without explicitly invoking 
the fibers of £, though the key point that S has distinct roots amounts to the same thing; 
in the context of descent the condition that the zero or pole multiplicity of y{t) ~ u he a, 
multiple of 3 at each point (tg, uq) of G is the condition of local triviality at tg of a principal 
homogeneous space for the isogeny Then [D] is a 3-torsion point on J{C), and the 

map {x, y) ^ [D] is an injection from Ls/\/^L{S) to J(G)[3]. It is clear that the restriction 
of this descent map to a root agrees with the nonzero element of J(G)[3] associated with the 
root in Theorem 1. 
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The kernel of the isogeny 2 is not cyclic, though its nonzero elements (0, — v^) = {0,—v), 
(0, —pv), (0, —pv) are permuted by p. The Weil fimctions for these points are x + v. x + pv. 
X + pv] their product x'^ + = x^ + S{t) = t/^ is a square. Thus evaluation of any two of 
these functions, say x + v and x -\- pv, at points outside £[2] extends to a homomorphism 
from £{k{C')) to (A;(C')*/fc(C")*^)^ whose kernel is 2£{k{C')). Better yet, we can evalu- 
ate all three functions and land in (k{C')* /k{C')* by which we mean the subgroup of 
(k{C')* /k{C')* ) consisting of triples whose product is the identity. We again claim that 
2£{k{C'))r\£{k{t)) = 2£{k{t)): otherwise we have Q = 2Qi with Q e £{k{t)) but Qi defined 
not over k{t) but only over k{C'): but then the other preimages of Q under multiplication 
by 2 would be the translates of Qi by {0,—p^v), which are also defined over k{C'), and we 
would obtain a biquadratic extension of k{t) split by the cubic extension k{C')/k{t), which 
is impossible. Thus the restriction of our homomorphism from £(k{C')) to Ls = £{k{t)) 
yields an injection from Ls/2Ls to (k{C')* /k{C')* Moreover this action is consistent 
with the action of p: substituting px for x cyclically permutes x + v, x + pv, x + pv modulo 
squares (indeed modulo /X3), and thus agrees with cyclic permutation of the three coordi- 

2\ 3 

nates of [k{C')* /k{C')* j^. Thus the image of our map is contained in the subgroup of 

/ 2 \ 3 

[k{C')* /k{C')* )g consisting of triples of the form {f,p*f,p*f), and we may consider it as 
an injection of F4-modules Ls/2Ls ^ (fc(C")*/fc(C")*^). As in the previous paragraph we 
see that each of the three functions x + p^w has divisor divisible by 2, and thus identify the 
image of our map in {k{C')* / k{C')*'^) with J(C")[2]. Once more this is the descent map on £ 
over k{t), this time for the multiplication-by-2 isogeny, and it is clear that the restriction of 
this descent map to a root recovers the odd element of J{C')[2] associated with the root in 
Theorem 2. 

To summarize: we have shown 

Theorem 5. The maps from the 240 representations of S{t) as xp' — to J(C)[3] and 
J(C")[2] obtained in Theorems 1 and 2 extend to surjective homomorphisms from Ls to 
J{C)[3] and J{C')[2] whose kernels are \/^Ls and 2Ls respectively; these homomorphisms 
are the descent maps for the isogenics and 2 on £. The moduli space S is the moduli 
space for sextics S{t) equipped with an isomorphism of Ls with E^; reducing mod \/— 3 
and 2 recovers the J(C)[3] and J(C")[2] constructions ofS. 

5. Complements and coming attractions. 

Orbits of the point stabilizer and the height pairing. The simple group G acts transitively on 
the 40 representations of S as the difference between the cube of a quadratic and the square 
of a cubic polynomial; but its action is not doubly transitive: the orbits of a point stabilizer 
have sizes 1, 12, 27. It is easy to see this from either the mod-3 or the mod-2 definitions 
of G: in each case the choice of a line in a symplectic F3 or an odd line in a unitary F4 
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divides the remaining 39 such hnes into 12 orthogonal to the first and 27 not orthogonal to it. 
But Clebsch already obtained the 12 + 27 partition in though he knew neither the Weil 
pairing on J(C)[3] nor the relevance of C . How, then, could he distinguish the orthogonal 
from the non-orthogonal pairs? In effect he did it using the height pairing! Given a root 
r e i?g, the 6 • 39 roots not proportional to r consist of 6 • 12 orthogonal to r and 6 • 27 
not orthogonal to it; if r, r' are roots neither proportional nor orthogonal to each other then 
their inner product (r, r') is one of /Xg • 2/ ^/—i. Now {r,r') is determined by the norms of 
r + r', r + pr' , and r + p^r' , which are even integers whose sum is 12 (cf. (^). If r, r' are 
not proportional then these norms are positive and < 8; clearly (r, r') = if and only if each 
norm equals 4, and so (r, r') 7^ if and only if the norms are 2, 4, 6 in some order. In the Eg 
picture, r, r' are minimal solutions {x{t),y{t)) and {x'{t), y'{t)) of y"^ = -\- S, and r + p^r' 
is a solution of y"^ = x^ + S whose coordinates are obtained from [x, y) and {p'^x' , y') by the 
group law on £ . The degrees of these coordinates as functions of t then determine the norm 
of r + p^r' via our "/i = ft," Proposition. The coordinates of r + p'^r' involve the slope quotient 
(y — y')/{x — p'^x'), and thus hinge on the distribution of the zeros oi y — y' among the zeros 
of a; — p'^x' . (Since y^ — y'^ — x^ — x'^ ^ the zeros of the three quadratic polynomials x — p^x' 
are the same as the zeros of the two cubics y ± y'.) We find that (r, r') = exactly when 
each of a; — p^x' contains just one of the linear factors of y — y', which is exactly Clebsch's 
condition for distinguishing 6 • 12 of the remaining 6 • 39 minimal solutions. 

Cubic curves tangent to six given concurrent lines. The problem of enumerating solutions of 
y^ — x^ + S(t) has also appeared in |Ty|, where they arose in yet another geometric guise: the 



forty cubic curves tangent to the six lines t = tj in the (s,t) plane, concurrent at the point 
(1 : s : i) = (0 : 1 : 0) at infinity. The connection of these cubics with J(C)[3] is then seen as 



a special case of the construction of the last section of [E&]. We explain these connections 
next. 

On a generic plane cubic C in the (s, t)-plane, i is a rational function of degree 3, and thus 
has six ramified points, each with one simple and one double preimage on C, corresponding to 
the six lines through (0:1:0) tangent to C. Now fix six distinct points tj E and consider 
curves C of genus 1 with a degree-3 map t : C ^ ramified at the tj. By the Riemann 
existence theorem for branched covers of P^, such curves are in 1:1 correspondence with 
6-tuples of involutions in the symmetric group ^3 whose product is the identity and which 
generate a transitive subgroup of 6*3, modulo conjugation in the subgroup they generate. 
Ignoring the transitivity condition and the conjugations we find 3^ — 243 such 6-tuples: the 
first five involutions may be chosen arbitrarily, and uniquely determine the sixth. Of those 
243, the transitivity condition excludes only those for which all six involutions are the same, 
leaving an adjusted total of 243 — 3 = 240. Each of these must generate all of S3, so our final 
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count of triple covers of P-*^ ramified at t — tj is 240/3! = 40. 

We can directly relate these 40 covers with with two of our three pictures of S. First, they 
naturally biject with solutions of y'^ = + S{t) in polynomials x{t),y{t) of degrees at most 
2,3, up to equivalence {x,y) ~ {fi^x,±y). Second, they naturally biject with pairs ±D of 
nontrivial 3-torsion points on the Jacobian of C : u'^ — S{t). 



The latter bijection was given in |E&|; more generally, for any distinct points ti, . . . ,^29+2 
in P^, a bijection was constructed between genus- (g — 1) triple covers of P^ simply ramified 
at the tj and pairs of nontrivial 3-torsion points on the Jacobian of the genus-(7 hyperelliptic 
curve — 0^=1^ ~ ^j)- This curve is the discriminant of the triple cover; its compositum 
(a.k.a. fiber product over P^) with the triple cover is an 5*3 cover of the line which is an 
unramified cyclic cubic cover of the hyperelliptic curve. By geometric Kummer theory such 
covers of any curve correspond bijectively with pairs of nontrivial 3-torsion points on the 
curve's Jacobian. For a hyperelliptic curve one readily recovers from its cyclic cubic cover 
a triple cover of P^ ramified at the curve's Weierstrass points. Given the tj there are thus 
(3^ — l)/2 such triple covers, whose coefficients generate a field extension of generic Galois 
group contained in PSp2g(r3). The count of (3^ — 1)/2 can also be obtained as in the previous 
paragraph by solving equations in S3. Before [E&| it was already known [ Coh that the Galois 
group is PSp2g(F3), but only by combining a transcendental monodromy computation with 
a difficult group-theoretical characterization of the permutation representation of PSp2g(F3) 
on P^s^^(F3). In |E&| the bijection with 3-torsion in the Jacobian was used to explain the 
symplectic structure in terms of the Weil pairing on the 3-torsion. 

On the other hand, it is noted in | poh that from C and t : C ^ P^ one may construct a 
solution of y^ ~ + S{t). Any function of degree 3 on a curve C of genus 1 is the quotient 
of two sections of a divisor D of degree 3; by Riemann-Roch, D has a three-dimensional 
space of sections, which identifies C with a cubic curve in the (s,t) plane up to affine-linear 
transformations preserving t, i.e. up to transformations of the form (i, s) 1— > (t, as + bt + c) . 
Using these transformations we put our cubic in the form — 3x{t)s + 2y{t) — where the 
polynomials x, y are of degree at most 2 and 3 respectively. This form is unique up to scalings 
(s, X, y) (As, \^x, \^y). The ramified points are the values of t at which the discriminant of 
this cubic in s vanishes, i.e. the roots of the sextic y^—x^. Scaling by A multiplies y^—x^ by A^. 
Thus there is a A, unique up to multiplication by /Xg, that yields y^ — x^ — S{t). Conversely, 
from a solution to y"^ — x^ — S{t) we recover a plane cubic C : s'^ + 2>x{t)s + 2y{t) = with 
a degree-3 cover f : C — > P^ ramified at the tj, with two solutions {x,y) producing the same 
curve if and only if they are equivalent under {x,y) ^ (fi^x^dzy). At this point in |Coh|, 
Cohen cites Clebsch for the enumeration of solutions of y^ ~ x^ ^ S{t); indeed |Coh| was the 
paper that alerted us to Clebsch's work on this problem. It is easy to see that the pair of 
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3-torsion points of J(C) constructed from ip — = S{t) in Theorem 1 is the same as the 
pair obtained from the cubic s"^ + 3x{t)s + 2y{t) = using the construction of |E& . 



Ground fields without roots of unity. We have assumed throughout that our ground field k 
contains the cube roots of unity. If we work over a field, such as Q, that does not contain 
them but still has characteristic other than 2 or 3, then we can still define the moduli spaces 
) and S, but the latter space's function field must contain /I3. This can be seen from each 
of our three pictures of S: in the J(C")[2] and Mordell-Weil pictures, via the action of p on 
C and £; in the J(C)[3] picture, via the Weil pairing. [For the latter, it might seem that the 
sign ambiguity in J(C)[3] may frustrate the extraction of a cube root of unity from the level-3 
structure. But the only ambiguity is the possibility of multiplying all of J(C)[3] by —1, and 
if P, Q are 3-torsion points whose Weil pairing {P, Q) is p then (— P, —Q) = p also.] The field 



p 

^ 6 

inducing the Galois involution of k(p)/k 



extension k(^S) of fc((^ )) then has Galois group Aut(G), with the outer automorphism of G 



Twists, and G vs. Aut(i?g). From a sextic S{t) €E k[t] without repeated roots we find, from 
the fiber of S above the associated point of ), a normal extension k' of k of degree at 
most 25920 and a map Gal(/c'/fc) G. But this k' is not the field of definition ks of Lg: 
for each minimal solution {x,y) of y"^ — x^ — S{t) the coordinates of x^,y'^ are contained 
in k', but not necessarily the coordinates of x,y themselves. Indeed it might seem that to 
obtain k' from k we might need to extract a different sixth root for each of the 40 minimal 
representations of S(t) as the difference between a square and a cube. But in the picture 
it is clear that a single cyclic sextic extension suffices to obtain ks from k': the group G is the 
quotient of Aut(£'g) by its center //g. Note, however, that the extension ks/k' is not visible 
at the level of moduli spaces: multiplying S" by A ^ does not change the associated point 
of ) and thus gives rise to the same extension k', but multiplies each of x^,y'^ by A and 
thus twists the extension ks/k' by \/X. Clearly ks/k' is the compositum of two extensions 
of fc', the first obtained by adjoining the coefficients of the x-coordinates of the roots, the 
second obtained by adjoining the coefficients of the y-coordinates. The former extension is 
either trivial or cyclic cubic, and the latter is either trivial or quadratic. Since the double 
cover 2.G of G does not split, the quadratic cover, though normal over k, cannot be obtained 
from ks by adjoining the square root of an element of k. On the other hand the index-2 
subgroup of Aut(i?g) does split as /X3 x G, so the subfield of ks obtained by adjoining each 
root's X coefficients is the compositum of k' with a cyclic cubic extension of k. It might be 
surmised that this extension is k{y/ c/S.{S)) for some c € Q*, where A(S') is the discriminant 
of 5'; since A(A5) = A^"A(5), this surmise would behave correctly under scaling of S. In a 
future paper we shall show that in fact k{yK(S)) (with c = 1) is the correct extension. 

Geometry of S and its fiber products with ( g ) and ( ) . We have given three descriptions 
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of the moduh space S but have not described its geometry. Burkhardt had aheady in 
effect identified S with an open set in his quartic hypersurface in P*, which is the zero locus 
of the degree-4 invariant of the five-dimensional representation of G mentioned in footnote ^ 
see also [Q, p. 190]. This hypersurface is rational [Q, p. 184-5], a fact Hunt attributed to Todd 
(1936), with an explicit birational map to P'^ first given by Baker six years later. The fiber 
product over ) oi S with P^-g^ — that is, the moduli space of genus-2 curves C with full 
level-6 structure, or equivalently of curves C with full level- (2\/~3) structure — is an open 
set in an algebraic threefold of general type |HW|. Recall that this moduli space, call it S' , 
arose in the course of Hunt and Weintraub's identification of the G-covers of ) coming 
from J(G)[3] and J(G')[2]; it is an 5*6 cover of S. We have investigated an intermediate, non- 
Galois cover of ( g ) , which is the fiber product of S with the moduli space of configurations of 
six points of P^ one of which is distinguished. We may put this point at infinity, and consider 
the remaining five as points on the affine line A^; in keeping with our earlier notation we thus 
call this moduli space ("^ ). Using the £ picture of S, and specializing the formulas of [3h|, 
we have shown that the resulting G-cover of ( ) is still rational, and is geometrically even 
nicer than S: it is the complement of hyperplanes in P^ = P{V)l The representation of 2.G 
on V yields the action of G on this projective space, and the excluded hyperplanes are the 
40 orthogonal complements of the roots of E^. We shall show this in a future paper; the 
computations there will also verify the claim in the previous paragraph that k{ ^/A{S)) C ks- 

Degenerations. In the same paper we shall also describe the behavior of the configuration of 
the tj, and thus also of J(G)[3], J(G')[2] and f , as we approach the excluded hyperplanes. 
For instance, we find that as two of the five finite tj approach one another, the point on P'^ 
approaches one of the hyperplanes, while as some tj — s- oo the point approaches one of the 
40 (projectivizations of) roots, which must be blown up to detect the configuration of the 
remaining four tj. Perhaps most strikingly, we obtain the moduli space ^2(3) of principally 
polarized abelian surfaces with full lcvel-3 structure by blowing up P^ at the 90 lines such as 
zi — Z2 — — that contain four of the 40 projectivized roots: .42(3) is the complement in 
that blown-up P'^ of the proper transforms of the 40 hyperplanes. 
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